Heterogeneous information networks, which consist of multi-typed vertices representing objects and multi-typed edges representing relations between objects, are ubiquitous in the real world. In this paper, we study the problem of entity matching for heterogeneous information networks based on distributed network embedding and multi-layer perceptron with a highway network, and we propose a new method named DEM short for Deep Entity Matching. In contrast to the traditional entity matching methods, DEM utilizes the multi-layer perceptron with a highway network to explore the hidden relations to improve the performance of matching. Importantly, we incorporate DEM with the network embedding methodology, enabling highly efficient computing in a vectorized manner. DEM’s generic modeling of both the network structure and the entity attributes enables it to model various heterogeneous information networks flexibly. To illustrate its functionality, we apply the DEM algorithm to two real-world entity matching applications: user linkage under the social network analysis scenario that predicts the same or matched users in different social platforms and record linkage that predicts the same or matched records in different citation networks. Extensive experiments on real-world datasets demonstrate DEM’s effectiveness and rationality.
In this paper, we propose a novel formulation extending convolutional neural networks (CNN) to arbitrary two-dimensional manifolds using orthogonal basis functions called Zernike polynomials. In many areas, geometric features play a key role in understanding scientific trends and phenomena, where accurate numerical quantification of geometric features is critical. Recently, CNNs have demonstrated a substantial improvement in extracting and codifying geometric features. However, the progress is mostly centred around computer vision and its applications where an inherent grid-like data representation is naturally present. In contrast, many geometry processing problems deal with curved surfaces and the application of CNNs is not trivial due to the lack of canonical grid-like representation, the absence of globally consistent orientation and the incompatible local discretizations. In this paper, we show that the Zernike polynomials allow rigourous yet practical mathematical generalization of CNNs to arbitrary surfaces. We prove that the convolution of two functions can be represented as a simple dot product between Zernike coefficients and the rotation of a convolution kernel is essentially a set of 2 × 2 rotation matrices applied to the coefficients. The key contribution of this work is in such a computationally efficient but rigorous generalization of the major CNN building blocks. 相似文献
The parameters governing the crystallisation of paracetamol using various conventional techniques has been extensively studied, however the factors influencing the drug crystallisation using spray drying is not as well understood. The aim of this work was to investigate the crystallisation of an active pharmaceutical ingredient through evaporative crystallisation using a spray dryer to study the physicochemical properties of the drug and to use semi-empirical equations to gain insight into the morphology and particle size of the dried powder. Paracetamol solutions were spray dried at various inlet temperatures ranging from 60 °C to 120 °C and also from a series of inlet feed solvent compositions ranging from 50/50% v/v ethanol/water to 100% ethanol and solid-state characterisation was done. The size and morphology of the dried materials were altered with a change in spray drying parameters, with an increase in inlet temperature leading to an increase in particle Sauter mean diameter (from 3.0 to 4.4 µm) and a decrease in the particle size with an increase in ethanol concentration in the feed (from 4.6 to 4.4 µm) as a result of changes in particle density and atomised droplet size. The morphology of the dried particles consisted of agglomerates of individual crystallites bound together into larger semi-spherical agglomerates with a higher tendency for particles having crystalline ridges to form at higher ethanol concentrations of the feed. 相似文献